Regression Analysis of Quantity Data with Exact Zeroes
نویسنده
چکیده
Measurements of the magnitude or duration of physical phenomenon have the property that they are positive and continuous, except for the possibility of exact zeroes when the phenomenon does not occur. Such data cannot be transformed to normality by power transformations or any other means, and special treatment of the zero observations is usually required. The approach of this paper is to model quantity data using a family of exponential family distributions intermediate between the Poisson and the gamma families. These families have the feature of power mean-variance relationships with exponent between one and two. Regression modelling is possible using the established framework of generalized linear models. With suitable assumptions this approach allows the information in both the zero and positive observations to contribute to the estimation of all parts of the model.
منابع مشابه
Structural zeroes and zero-inflated models
SUMMARY In psychosocial and behavioral studies count outcomes recording the frequencies of the occurrence of some health or behavior outcomes (such as the number of unprotected sexual behaviors during a period of time) often contain a preponderance of zeroes because of the presence of 'structural zeroes' that occur when some subjects are not at risk for the behavior of interest. Unlike random z...
متن کاملA flexible zero-inflated model to address data dispersion
Excess zeroes are often thought of as a cause of data over-dispersion (i.e. when the variance exceeds the mean); this claim is not entirely accurate. In actuality, excess zeroes reduce the mean of a dataset, thus inflating the dispersion index (i.e. the variance divided by the mean). While this results in an increased chance for data over-dispersion, the implication is not guaranteed. Thus, one...
متن کاملRegression calibration when foods (measured with error) are the variables of interest: markedly non-Gaussian data with many zeroes.
Regression calibration has been described as a means of correcting effects of measurement error for normally distributed dietary variables. When foods are the items of interest, true distributions of intake are often positively skewed, may contain many zeroes, and are usually not described by well-known statistical distributions. The authors considered the validity of regression calibration ass...
متن کاملCount Data Models in SAS ®
Poisson regression has been widely used to model count data. However, it is often criticized for its restrictive assumption of equi-dispersion, meaning equality between the variance and the mean. In real-life applications, count data often exhibits over-dispersion and excess zeroes. While Negative binomial regression is able to model count data with over-dispersion, both Hurdle (Mullahy, 1986) ...
متن کاملComparison of classic regression methods with neural network and support vector machine in classifying groundwater resources
In the present era, classification of data is one of the most important issues in various sciences in order to detect and predict events. In statistics, the traditional view of these classifications will be based on classic methods and statistical models such as logistic regression. In the present era, known as the era of explosion of information, in most cases, we are faced with data that c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007